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ABSTRACT 

Since 1997 the National Center for Documentation and Evaluation of Alternative Methods 
to Animal Experiments, ZEBET, in Berlin, has been coordinating a validation study aimed 
at prevalidation and validation of three in vitro embryotoxicity tests, funded by the Eu- 
ropean Center for the Validation of Alternative Methods (ECVAM) at the Joint Research 
Center (JRC, Ispra, Italy). The tests u&e the cultivation of postimplantation rat whole em- 
bryos CWEC test), cultures of primary limb bud cells of rat embryos (micromass or, MM, 
test), and cultures of a plurlpotent mouse embryonic stem cell line (embryonic stem cell 
test or EST). Each of the tests was performed in four laboratories under blind conditions. 
In the preliminary phase of the validation study 6 out of 20 test chemicals comprising 
different embryotoxic potential (non, weakly, and strongly embryotoxic} were tested. The 
results were used to define biostatistically based prediction models (PMs) to identify the 
embryotoxic potential of test chemicals for the WEC test and the MM test. The PMs de- 
veloped with the results of the preliminary phase of the validation study (training set) 
will be evaluated with the results of the remaining 14 test chemicals (definitive phase) by 
the end of the study. In addition, the existing, improved PM (iPM) for the EST, which had 
been defined previously, was evaluated using the results of the preliminary phase of this 
study. Applying the iPM of the EST to the results of this study, in 79% of the experiments, 
chemicals were classified correctly according to the embryotoxic potential defined by in 
vivo testing. For the MM and the WEC test, the PMs developed during the preliminary 
phase of this validation study provided 81% (MM test) and 72% (WEC test) correct clas- 
sifications. Because the PM of the WEC test took into account only parameters of growth 
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and development, but not cytotoxicity data, a second PM (PM2) was developed for the 
WEC test by incorporating cytotoxicity data of the differentiated mouse fibroblast cell line 
3T3, which was derived from the EST. This approach, which has previously never been 
used, resulted in an increase to 84% correct classifications in the WEC test. 



INTRODUCTION 

For industrial chemicals within the European 
Union (EU), no information is available on 
reproductive toxicity for the majority of base set- 
chemicals, which are produced at less than one 
ton per year or less than five tons in total. There- 
fore, EU regulators would appreciate better in- 
formation on chemicals exhibiting reproductive 
toxicity early on and they are even prepared to 
apply in vitro embryotoxicity tests for regulatory 
purposes. However, as yet, there is no agree- 
ment on which screening test should be used. 
Moreover, in the field of drug development, in- 
dustry is using in vitro tests in high throughput 
screening (HTS) to identify the most promising 
new compounds. In HTS the identification of 
toxic properties of new compounds has a high 
priority early on, even before in vivo testing in 
animals or humans is started. Because adverse 
effects on reproduction are among the most haz- 
ardous side effects of drugs, there is an in- 
creasing demand to include in vitro embry- 
otoxicity tests in HTS systems for selecting "lead 
chemicals" in drug development. 

For the reasons described European Cen- 
ter for the Validation of Alternative Methods 
(ECVAM) has held a workshop on screening 
chemicals for reproductive toxicity (Brown et 
al., 1995). According to the recommendations of 
this workshop, there is a need to validate in 
vitro tests in reproductive toxicology. Although 
several in vitro tests have been proposed as tests 
for developmental toxicity, none of them has 
successfully been validated to date. The ECVAM 
workshop therefore recommended that further 
work should be directed to improve and vali- 
date existing in vitro embryotoxicity tests and 
to introduce new ones. According to the defin- 
ition of the EU, the Organization for Economic 
Co-operation and Development (OECD) and the 
U.S. NIEHS (National Institute of Environmental 
Health Sciences) (Balls et al., 1990, 1995; NIEHS, 
1997; OECD, 1996), validation is the process by 



which the relevance, reliability and predictabil^. 
ity of alternative toxicological testing methods 
is assessed to reduce, refine, or replace the use 
of laboratory animals. The main purpose of a 
validation study is to conduct an interlaboratory 
blind trial that should comprise a preliminary 
phase, in which a small number of coded chem- 
icals, or "training set," is tested, and a definite 
phase, followed by data analysis and evaluation 
of the outcome of the study. 

Biostatistical methods should be used 
throughout development and validation of al- 
ternative test methods. Hence, demonstration of 
reliability and relevance of an alternative method 
not only implicates the assessment of the vari- 
ability of data, but also involves the establish- 
ment of PMs, which are able to predict the toxic 
effects likely to occur in vivo from the results . 
obtained in the in vitro system. 

PMs were established in the preliminary phase 
of the validation study of three of the most 
promising in vitro tests in a ring trial funded by 
the ECVAM: the rat whole embryo culture test 
(WEC test), the micromass test (MM test), and 
the embryonic stem cell test (EST). It is the goal 
of the ECVAM validation project on in vitro em- 
bryotoxicity tests to finally determine whether 
currently available in vitro methods are capable 
of predicting in vivo embryotoxic potential. Here 
we present the first results obtained with the 
PMs of the WEC and the MM test and report on 
the evaluation of the iPM of the EST using the 
results of the six test chemicals of the prelimi- 
nary phase of the formal validation study con- 
ducted under blind conditions. 

MATERIALS AND METHODS 

Detailed standard protocols of the three in 
vitro embryotoxicity tests will be published else- 
where and will be submitted to INVITTOX, the 
ECVAM databank of in vitro techniques in tox- 
icology. 
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Embryonic Stem Cell Test 

The EST was performed as described previ- 
ously (Scholz et al., 1999; Spielmann et aL, 
1997), and is an update of INVTTTOX protocol 
no. 113 (1996). Principally, the test is composed 
of two procedures, a cytotoxicity test that is con- 
ducted both with the mouse embryonic stem 
(ES) cell line D3 (Doetschman et al., 1985) and 
cells of the differentiated mouse fibroblast cell 
line 3T3, and a differentiation assay using D3 
cells. For the cytotoxicity test, cells were seeded 
into each well of a 96-well microtiter plate and 
grown in the presence of a range of concen- 
trations of test chemical. A negative control con- 
taining solvent diluted in medium was also in- 
cluded. After 10 days of culture with 2 changes 
of medium (containing the appropriate concen- 
tration of test chemical/solvent) on Days 3 and 
5, the viability of cells was determined using the 
MTT test (Mosmann, 1983). This test is based 
on the ability of mitochondrial dehydrogenases 
to convert the yellow substrate MTT into a dark 
blue formazan product, which is detected quan- 
titatively using a microplate enzyme-linked irn- 
munoadsorbent assay (ELISA) reader. Cytotoxi- 
city is expressed as the concentration inhibiting 
growth by 50% of the control level (IC 50 3T3 and 
IC 50 D3; determined from a concentration re- 
sponse curve). 

For the differentiation assay, D3 cells were 
grown for 3 days in a "hanging drop" culture in 
the presence of a concentration range of the test 
chemical. During this period, D3 cells formed 
aggregates called embryoid bodies (EBs). After 
3 days of culture EBs were transferred to bac- 
terial petro dishes. On Day 5 EBs were plated 
into wells of a 24-well tissue culture plate to al- 
low adherence and outgrowth of the EBs and 
development of contracting cardiac muscle cells. 
Differentiation was determined by microscopic 
inspection of EBs on Day 10. The number of 
wells containing contracting cardiac muscle cells 
was determined for each plate and compared 
with the number of wells containing contract- 
ing cardiac muscle cells on a solvent control 
plate. The inhibition of differentiation (ID 50 ) is 
expressed as the concentration of test chemical 
inhibiting the development of contracting car- 
diac muscle cells by 50% (calculated from a con- 
centration response curve). 



Micromass Test 

The MM test is modified from the INVITTOX 
protocol no. 114 (1996) and uses cultures of dis- 
sociated cells of limb buds of rat embryos 
(Amacher et al., 1996 Flint and Orton, 1984; 
Kistler, 1987; Uphill et al., 1990). On Day 14 of 
gestation, embryos were prepared from preg- 
nant rats. Limb buds were dissected under the 
microscope. After dissociation the cells were 
seeded into 96-well plates at high density spots 
("micromass") in an appropriate culture 
medium, which allows differentiation into chon- 
drocytes. After 5 days of culture in the presence 
of a series of concentrations of test chemicals, 
growth and differentiation of limb bud cells was 
evaluated. The extent of differentiation into 
chondrocytes was assessed by staining with the 
chondrocyte specific dye Alcian Blue (AB). The 
cytotoxic effect on the same cells was deter- 
mined by staining with Neutral Red (NR). The 
concentrations inhibiting AB staining (differen- 
tiation) by 50% in comparison to unexposed 
controls (ID 50 ) and the concentrations reducing 
NR staining by 50% (IC 50 ) were estimated from 
concentration response curves. 

Rat Whole Embryo Culture Test (WEC) 

The test protocol was derived from a previ- 
ous European interlaboratory validation study 
(Piersma et al., 1995, 1996). On Day 10 of ges- 
tation, rat embryos at the 1-5 somite stage were 
prepared from pregnant females. At this early 
stage embryos are usually more sensitive to 
xenobiotics than older ones. During 48 h of cul- 
ture, major steps of organogenesis occur, in- 
cluding heart development, closure of the neural 
tube, development of ear and eye, branchial 
bars, and limb buds. Interference with devel- 
opment during this period may lead to general 
retardation of growth and development or to 
specific malformations in one or several organ 
anlagen. Embryos were cultured in rotating con- 
tainers. At the end of culture, the morphology 
of the embryos was carefully assessed. Com- 
parison of control embryos to exposed ones was 
the measure for determining the embryotoxicity 
of test compounds. Morphological, develop- 
mental, functional, and growth parameters were 
scored according to a modified scoring system 
first defined by Brown and Fabro (1981). Data 



54 



GENSCHOW ET AL. 



from seven embryos was collected for each con- 
centration and the solvent control. The mean to- 
tal morphological score (TMS; equal to the sum 
of all scores of the scoring system) was evalu- 
ated for each concentration in comparison to 
the mean TMS of the control group, which is 
set to 100%. In addition, number and specificity 
of malformations were recorded for each group 
of embryos. In the present report, the rate of 
malformed embryos of each concentration 
group, irrespective of the number of malforma- 
tions of individual embryos was evaluated in 
comparison to the control group. Results were 
assessed by determining no effect concentra- 
tions (ICmoecX 50% effect concentrations (IC 5 o) 
and maximum effect concentrations (IC max ) from 
concentration response curves. 

Test Chemicals 

During the formal validation study, 20 chem^ 
icals and the positive control 5-Fluorouracil (5- 



FU) were tested under blind conditions. To pro- 
vide a balanced representation of non, weakly, 
and strongly embryotoxic chemicals in the study, 
as well as a sufficient number of test chemicals, 
it was decided to test six strongly embryotoxic, 
seven weakly embryotoxic, and 7 nonembry- 
otoxic chemicals (Table 1). The test chemicals 
were selected in a subcontract during the preval- 
idation phase of the study, on condition that the 
evaluated in vivo data and in vitro data from 
animals and humans were of high quality. The 
final selection and classification of test chemi- 
cals was critically reviewed and approved by the 
ECETOC (European Chemical Industry Ecology 
& Toxicology Centre, Brussels, Belgium) "Task 
Force on Reproductive Toxicology." The chem- 
ical selection study will be published in the fi- 
nal report of the formal validation study. 

Coding and shipment of test chemicals was 
performed by RCC-Cytotest Cell Research (RojS 
dorf, D). The test chemicals were purchased 
from Merck (Darmstadt, Germany) and Sigma 



Table 1. Twenty Selected Test Chemicals Used 


in the ECVAM Validation Study 3 


Test chemical 


CAS No. 


Company 


Cat. No. 




In vivo strongly embryotoxic 




5-Bromo-2'-deoxyuridine 


59-14-3 


Sigma 


B9285 


Methyl mercury chloride 


115-09-3 


Aldrich 


442534 


Hydroxyurea 


127-07-1 


Sigma 


H8627 


Methotrexate 


59-05-2 


Sigma 


A6770 


All-trans-retinoic acid 


302-79-4 


Roche 


Dr. H. Biirgin 


6-Aminonicotinamide 


329-89-5 


Sigma 


A0630 




In vivo weakly embryotoxic 




Boric aci<J 


10043-35-3 


Sigma 


B7660 


Pentyl-4-yh-VPA 




TiHo Hannover 


Prof. H. Nau 


Valproic acid 


99-66-1 


Merck 


8.14439.0025 


Lithium chloride 


7447-41-8 


Sigma 


L4408 


Dimethadione 


695-53-4 


Sigma 


D7631 


Methoxyacetic acid 


625-45-6 


Merck 


8.21532 


Salicylic acid sodium salt 


54-21-7 


Sigma 


S3007 




In vivo nonembryotoxic 




Acrylamide 


79-06-1 


Sigma 


A9099 


Isobutyl-ethyl-VPA 




TiHo Hannover 


Prof. H. Nau 


D-(+)-Camphor 


464-49-3 


Sigma 


C9380 


Dimethyl phthalate 


131-11-3 


Merck 


8.00918 


Diphenhydramine hydrochloride 


147-24-0 


Sigma 


D3630 


Penicillin G sodium salt 


69-57-8 


Sigma 


PEN-NA 


Saccharin sodium hydrate 


82385-42-0 


Sigma 


S1002 



a (Smith et aL, 1983; Shepard, 1992; Spielmann, 1998.) 
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(Deisenhofen, Germany) except the following 
ones: all-trans-retinoic acid (RA) was supplied 
by Dr. Heinrich Biirgin (Hoffmann La Roche, 
Basel, Switzerland), isobutyl-ethyl-VPA and 
pentyl-4-yn-VPA was donated by Prof. Heinz 
Nau (Zentrumsabteilung fur Lebensmittel- 
toxikologie der Tierarztlichen Hochschule Han- 
nover, Germany). 

As is common practice in validation studies 
under blind conditions, all laboratories have to 
assume that each test chemicals may be severely 
toxic, and sensitive to temperature and UV ir- 
radiation. Thus, all chemicals had to be handled 
according to the most appropriate safety regu- 
lations. 

Biostatistical Analysis 

The statistical data analysis focused on the de- 
velopment of PMs for the WEC test and the MM 
test and for the assessment of the quality of the 
iPM model developed for the EST (Scholz*et al., 
1999). Endpoint values were determined in par- 
ticipating laboratories by employing EXCEL 
spreadsheets, which were designed for data 
recording of each in vitro embryotoxicity test. 
Values for IC 50 > ID 50 , ICnoeo and IC max were 
calculated by estimation from concentration re- 
sponse curves and were determined in two in- 
dependent runs of the EST and MM test, and 
for mean values of a total number of at least 
seven embryos in the WEC test. Results of in- 
dividual experiments in the EST and the MM test 
were evaluated independently; in the WEC test, 
results of seven embryos were pooled and eval- 
uated as a single experiment. 

The PMs were calculated using stepwise dis- 
criminant analysis (NoruSis, 1994). 

RESULTS 

According to the recommendations of ECVAM 
for the prevalidation and validation of toxicity 
test procedures, Standard Operating Procedures 
(SOPs) of the three tests were worked out in 
the lead laboratories during the prevalidation 
phase of the study and were transferred to the 
second laboratory (Curren et al., 1995). The 
SOPs for the three in vitro embryotoxicity tests 
were successfully established in the second lab- 
oratory and reproducible results were obtained. 



As a result of the prevalidation study, the three 
methods were able to discriminate a limited set 
of uncoded test chemicals according to their em- 
bryotoxic potential (Scholz et al., 1998). 

During the formal validation study, which is 
currently under way, 20 chemicals are being 
tested under blind conditions in each of four 
European laboratories. In the preliminary pljase 
of the validation study, 6 of the 20 chemicals^ 
(two chemicals not embryotoxic in vivo, 2 
weakly embryotoxic, and 2 strongly embry- 
otoxic) were tested. 

To maintain the blind character of the study 
and to allow evaluation of the prediction model 
of the three in vitro embryotoxicity tests in the 
preliminary phase of the validation study, the 
embryotoxicity classes were decoded by the 
subcontractor for coding and shipment of the 
chemicals, RCC Cytotest Cell Research (Ro0dorf, 
Germany), without providing information on the 
identity of the chemicals. This information was 
used by the biostatistician in the present study 
on PMs. 

The remaining 14 test chemicals will be used 
for the evaluation of the PMs of the MM and 
WEC test. Because an improved PM has al- 
ready been established for the EST, all 20 test 
chemicals will be used for the evaluation of 
the iPM. 

Structure of the Validation Study 

The structure of the validation study is illus- 
trated in Fig. 1. The main contractor ZEBET 
(Center for Documentation and Evaluation of Al- 
ternative Methods to Animal Experiments at the 
BgW, Berlin, Germany) is conducting the EST 
as lead laboratory; in addition, the European 
Center for the Validation of Alternative Methods 
(ECVAM, Ispra, Italy), Novartis (Basel, Switzer- 
land), and Schering (Berlin, Germany) are par- 
ticipating with the EST. The MM test is being 
performed by the lead laboratory St. George's 
Hospital Medical School (SGHMS, University of 
London, UK) and the National Institute of 
Public Health and the Environment (RIVM, 
Bilthoven, Netherlands), KTL Finland (Kuopio, 
Finland), and Synthelabo Recherche (Gar- 
genville, France). The WEC test is being per- 
formed by the lead laboratory RIVM (Bilthoven, 
Netherlands), SGHMS (London, UK), Novartis 
(Basel, Switzerland), and AstraZeneca CTL (Mac- 
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FIG. 1. Structure of the validation study. In Fig. 1 the contractor and subcontractors are shown The sub- 
■ contractors report the results to 2EBET. ZEBET reports summaries to the sponsor ECVAM. The interde- 
pendences are marked by arrows (\VEC, rat whole embryo culture test; MM, micromass test; EST embry- 
onic stem cell test) 



clesfield, UK). The laboratories are coded A, B, 
C, and D for the WEC test, E, F, G, and H for 
the MM test, and I, J, K, and L for the EST. 

Development of Prediction Models Exemplified 
by the EST 

During a formal validation exercise, in vitro 
tests have to prove their reliability and relevance 
(Balls et al., 1990, 1995; NIEHS, 1997; OECD, 
1996). For the first requirement, reliability, indi- 
vidual laboratories have to show a sufficiently 
high level of accuracy of independently re- 
peated results. The second requirement, rele- 
vance, is estimated applying a biostatistically 
based PM. In the present study the PM should 
be able to discriminate between the three classes 
ofembryotoxicity, won, weakly and strongly em- 
bryotoxic, as determined by in vivo data. 

A PM is applied to use the results of an in 
vitro toxicity test to predict the toxic potential 



of a chemical in vivo. The goal of the present 
study was to identify variables of the PM that 
are important for distinguishing among the 
classes of embryotoxicity and to develop a pro- 
cedure for correctly classifying new chemicals. 
For the PMs used in this study, discriminant 
analysis was chosen as the mathematical model. 
Discriminant analysis allows the determination 
of algorithms to distinguish among the three 
classes of embryotoxicity. Ideally, one variable 
is sufficient to distinguish among the three 
classes. However, in most cases two or more 
variables that are analysed together are neces- 
sary to distinguish between the three classes ad- 
equately (Backhaus et al., 1996). This procedure 
was used for the three in vitro embryotoxicity 
tests. 

First, the concentration response curves from 
single experiments of several test chemicals 
were used to identify potential endpoints and 
variables. As an example, the endpoints and the 
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FIG. 2. Endpoints and variables used in the improved prediction model (iPM). Diagram shows concen- 
tration response curves for the three endpoints cytotoxicity of D3 and 3T3 and inhibition of differentiation 
of D3 cells. The curves were obtained from single experiments using the positive control chemical 5-FU. 
Endpoints and variables are marked by arrows. 



variable applied in the improved PM (iPM) of 
the EST are shown in Fig. 2, which displays the 
results of testing the positive control chemical 
5-FU. 

The three endpoints (IC5o3T3; IC 50 D3; and 
ID50) cannot be used directly as variables in the 
linear analysis of discriminance. To convert the 
numerical value of the endpoint into a linear 
model, the logarithm of the 50% inhibition con- 
centrations was calculated. The term "endpoint" 
is used only for the 50% growth inhibition con- 
centration (IC 5 o3T3, IC 50 D3) and the 50% differ- 
entiation inhibition concentration (ID^q), calcu- 
lated from the concentration response curve. 
The term "variable" is applied to all mathemat- 



ical calculations Oogarithm, relative distance) of 
the original endpoints. 

EST 

Improved Prediction Model. The concentration 
response curves of two independent, valid ex- 
periments of the positive control chemical 5-FU 
are shown in Fig. 2. The endpoints and the vari- 
able are marked by arrows. To allow inclusion 
of occasionally divergent test results for viabil- 
ity and differentiation in the model, the relative 
distance between the two endpoints IC503T3 and 
ID50 was used as a variable. This variable is in- 
dependent of the absolute concentration values, 



Table 2. Linear Discriminant Functions I, II, and III of the EST 



Function I 



Function II 



Function III 



5.92 Ig(IC 50 3T3) + 3.50 lg(IC 50 D3) - 5.31 lCl ^J T ^° ~ ^ 
3.65 IgClC 5 o3T3) + 2.39 Ig(IC 50 D3) - 2.03 IC ^c^3^ " 6 85 
-0.125 Ig(IC 50 3T3) - 1-92 Ig(IC 50 D3) + 1.50 ^^^^ - 2.67 
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because this information is already given by the 
two other variables, the IgdC 50 3T3) and the 
Ig(IC 50 D3). Thus, the in vitro test provided a to- 
tal of five different variables (Ig(IC 50 3T3); 
Ig(IC5oD3);,Ig(ID 3 o); relative distance between 
IC 50 3T3 and ID 50 ; ratio of IC503T3 and ID 50 ) each 
of which may contribute to distinguish between 
classes of embryotoxicity. A stepwise selection 
of variables was performed using analysis of 
discriminance (procedure: "stepwise" of SPSS; 
Norusis, 1994). In a stepwise fashion, each of 
the variables is added separately to the -model 
and rejected again, if it does not improve the 
separation of three classes of embryotoxicity sig- 
nificantly. Three of the variables were accepted 
in the analysis. For the EST, the linear discrim- 
inant functions incorporating the three variables 
are given in Table 2. 

The following procedure was applied to pre- 
cisely classify the chemicals according to the PM: 
The IC5o3T3, the IC50D3, and ID50 were deter- 
mined and used in the three linear discrimfnant 
functions. If the result of function I exceeds the 
results of functions II and III, the chemical is 
classified nonembryotoxic, if the result of func- 
tion II exceeds the results of functions I and III, 
the chemical is classified weakly embryotaxic. 
Finally, if the result of function III exceeds the 
results of functions I and II, the chemical is clas- 
sified to be strongly embryotoxic. 

The classification of the training set of chem- 
icals (for more details see Scholz et. al., 1999) 
according to the iPM resulted in 93% correct 
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classifications (accuracy; Table 3a). Correct clas- 
sifications are marked by grey boxes. The num- 
ber of correctly classified individual experiments 
relative to the sum of all experiments is given 
as percent accuracy. A model usually fits the 
training set of chemicals better than it will fit 
new chemicals. Taking this limitation into con- 
sideration, the percentage of correctly classified 
individual experiments is an over optimistic es-, 
timation. It must therefore be expected that the" 
accuracy of the model will decrease when com- 
pletely different chemicals are evaluated. 

For the evaluation of a PM the percentage of 
correctly classified individual experiments is not 
sufficient as the only index to describe the pre- 
dictive power of the discriminant functions. In 
addition, the rate of correct classifications ob- 
served should be compared with the rate of cor- 
rect classifications that can be expected just by 
chance. For three classes an a priori probabil- 
ity of 33% of correct classifications can be ex- 
pected merely by change. 

Evaluation of the EST with Six Chemicals of 
the Preliminary Phase. To evaluate the iPM, the 
six chemicals of the preliminary phase of the 
blind trial were classified according to the iPM. 
Complete sets of data were obtained from the 
four participating laboratories (I, J, K, and L). 
Results from a total of 48 individual experiments 
were included in the present analysis. 

The iPM discriminated sufficiently between 
the three classes of embryotoxicity, although it 



Table 3. Classification Results of the EST (Scholz et al., 1999) Using the Improved 
Prediction Model (iPM): (a) 93% of Single Experiments of the Training Set of Chemicals 
and (b) 79% of Single Experiments in the Validation Were Correctly Classified 



Embryotoxicity 
in vivo 



No. of test 
chemicals 



No. of 
experiments 



Predicted class 

in vitro 
1 2 3 



(a) Training set of 
chemicals 

(b) Evaluation with 6 
test chemicals 



non embryotoxic 3 
weakly embryotoxic 5 
strongly embryotoxic c 
non embryotoxic 
weakly embryotoxic 
strongly embryotoxic 



3 
3 
4 
2 
2 
2 



9 
9 
12 
16 
16 
16 



1 

0 



2 
1 



1 
6 



0 
0 

1! 
0 
0 



a Class 1. 
b Class 2. 
c Class 3- 
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was developed with the results of only ten test 
chemicals in a prevalidation study (Table 3a and 
Scholz et al., 1999). With the six chemicals of 
the preliminary phase, 79% of individual ex- 
periments were correctly classified (Table 3b), 
confirming the above statement, that a decrease 
in the correct classification rate must be ex- 
pected when chemicals different from the train- 
ing set are evaluated. 

MM test: Development of the Prediction Model 

Laboratories E and H submitted complete data 
sets. From the positive control experiments of lab- 
oratory F, which usually serve as a quality control 
of concurrent experiments, IC50 values could not 
be computed because the curves did not cover the 
relevant range of concentration response accord- 
ing to the acceptance criteria denned in the SOP. 
Therefore, the data of this laboratory were not eval- 
uated. For three of the six test chemicals tested in 
laboratory G, only one valid experiment wfes avail- 
able instead of two. Including the positive control 
experiments, a total of 33 individual experiments 
were assessed in the present analysis. 

In addition to the results of the preliminary 
phase of this trial, data from the prevalidation 
phase of the study were included to calculate 
the discriminate analysis (29 experiments of the 
prevalidation and 33 experiments for the 6 
chemicals of the preliminary phase of the vali- 
dation study). 



According to the recommendations of ECVAM 
for the prevalidation of alternative toxicological 
methods (Curren et al., 1995), PMs should be 
derived as one of the major objectives of a 
prevalidation study. During the prevalidation 
trial of this study, only three test chemicals and 
a positive and a negative control chemical were 
tested. However, a data set of five test chemi- 
cals is considered to be insufficient for ^devel- 
oping a PM. Furthermore, the chemicals tested 
during prevalidation exhibited an almost identi- 
cal potential for cytotoxicity and inhibition of 
differentiation. To calculate a PM that is able to 
discriminate test chemicals characterized by di- 
vergent potential for cytotoxicity and inhibition 
of differentiation, and also to increase the num- 
ber of test chemicals, results of the prevalida- 
tion and of the preliminary phase of the vali- 
dation study were included. 

The procedure for developing the PM was 
comparable to the procedure described above 
for the EST. Concentration response curves were 
analyzed for two endpoints, cytotoxicity deter- 
mined by NR staining and inhibition of differ- 
entiation determined by AB staining. Figure 3 
shows the results of a positive control experi- 
ment using 5-FU. The in vitro test provided a 
total of four experimental variables [Ig(IC5 0 X 
Ig(ID 50 ), relative distance between IC50 and ID 50 ; 
and ratio of IC 50 vs. 1D 50 ]; each of them may 
contribute to distinguish between classes of em- 
bryotoxic chemicals. Although no systematic dif- 



Neutral red: cytotoxicity 
Alcian blue: differentiation 
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FIG. 3. Endpoints of the MM-test. Diagram shows concentration response curves for the two endpoints 
of the MM test: cytotoxic effect determined by staining with Neutral Red (IC50) and inhibition of differen- 
tiation assessed by staining with Alcian Blue (IDso)- The curves were obtained from two single experiments. 
Endpoints are marked by arrows. 
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Table 4. Linear Discriminant Functions 
i, ii, and iii of the mm test 

Function I 6.65 * Ig(ID 50 ) - 9.49 

Function II 6.16 • Ig(ID 50 ) - 8.29 

Function III -1.31 * Ig(ID 50 ) - 1.42 



ference was apparent between cytotoxicity and 
inhibition of differentiation, a stepwise selection 
of variables was performed using analysis of dis- 
criminate. As a result, the logarithm of the con- 
centration that inhibited differentiation by 50% 
[Ig(ID5o)l was identified as an appropriate vari- 
able in the linear discriminant analysis. Using 
the variable selected for the MM test, three lin- 
ear discriminant functions are shown in Table 
4. The procedure for classifying chemicals ac- 
cording to the PM is described above. 

The results of the preliminary phase confirmed 
the observation from the prevalidation study that 
no significant difference was found between cy^ 
totoxicity data and inhibition of differentiation. 
Therefore, the twofold rule proposed as PM in 
previous studies and in the INVITOX protocol 
no. 114 (1996 and references therein), proved 
insufficient to predict the embryotoxic potential 
of test chemicals in the present study. 

The training set of chemicals provided 81% 
correct classifications according to the PM (Table 
5). The discrimination between non and weakly 
embryotoxic test chemicals proved to be diffi- 
cult because several of the chemicals were at 
the borderline between non and weakly em- 
bryotoxic. Remarkably, 100% of the strongly em- 
bryotoxic chemicals were classified correctly in 



the PM. As outlined above, a model usually fits 
the training set of chemicals better than it will 
fit new test chemicals. Thus, the percentage of 
correctly classified chemicals in the training set 
is an over optimistic estimation. The evaluation 
of the current PM has to await the results of test- 
ing the remaining 14 test chemicals and will be 
conducted at the end of the validation study. 

WEC test: Development of Prediction Models 

Compared with the two previous in vitro 
tests, the WEC test is a laborious test because 
a large number of parameters have to be as- 
sessed for each embryo. Four laboratories (A, 
B, C, and D) provided complete data sets for 
the six test chemicals and the positive and neg- 
ative controls. Hence, 32 experiments including 
positive and negative control experiments were 
submitted from the preliminary phase of the val- 
idation study. This data set was used to develop 
the PM. Data obtained during prevalidation 
were not used for the following reason: during 
the prevalidation study, the assessment of end- 
points in the WEC test was slightly different 
from the determination of endpoints in the pre- 
sent study, especially with respect to malfor- 
mations. Namely, during prevalidation, severely 
malformed embryos as well as embryos with 
functional defects, for example, missing heart- 
beat and missing circulation, were scored as 
"dead." Consequently, the number and speci- 
ficity of malformations was not assessed prop- 
erly during prevalidation and severely mal- 
formed embryos were not distinguished from 
embryos with defective heartbeat and circula- 



Table 5. Classification Results of the MM Test (29 Experiments of the 
Prevalidation Study and 33 of the Preliminary Phase of the Validation 
Study): 81% of the Training Set of Chemicals Were Correctly Classified 



Predicted class 





Embryotoxicity 


No. of test 


No. of 




in vitro 




in vivo 


chemicals 


experiments 


7 


2 3 


Training set of 


non embryotoxic 3 


4 


24 


17 


"7 0 


chemicals 


weakly embryotoxic 5 


3 


17 


5 


12 0 




strongly embryotoxic 0 


5 


21 


0 


0 21 



a Class 1. 
b Class 2. 
c Class 3. 
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tion. However, in the validation trial, numbers 
and specificities of malformations were 
recorded for all embryos, independent from 
functional parameters, for example, heartbeat 
and circulation. The term "dead," which was 
considered a poor endpoint providing insuffi- 
cient information, was not used in the valida- 
tion study. 

Using linear discriminant analysis two PMs 
(PM1 and PM2) were developed for the WEC 



test from data of the preliminary phase of the 
validation study. 

Prediction Model 1 (PM1X The in vitro test 
initially provided seven different experimental 
endpoints: 

* ICnoec f° r malformation = the highest^ con- 
centration that has no effect on the rate of 
malformations; ~ 
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FIG. 4. Endpoints of the WEC-test. Diagram (A) shows a concentration response curve for the three end- 
points for malformations of the WEC test. IC NO ec is defined as the highest concentration that has no effect 
on the malformation rate, IC50 is the concentration at which 50% of the embryos are malformed and IC- 
max is defined as the lowest concentration at which a maximum malformation rate is obtained. Endpoints 
are marked by arrows. Diagram (B) shows a concentration response curve for the TMS (total morpholog- 
ical score). The endpoint ICnoec is defined as the highest concentration that has no effect on TMS (marked 
by an arrow). 
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Table 6. Linear Discriminant Functions I, II, 

AND III OF PM1 OF THE WEC TEST 



Function I 18.08 * IgOCsoMal) - 11.56 * Ig(IC NOEC TMS) - 10.19 

Function II 21.55 * IgOCsoMal) - 15.31 * Ig(IC NOEC TMS) - 10.65 

Function III 8.70 * Ig(IC 5a Mal) - 8.53 * Ig(IC NOEC TMS) - 2.53 

TMS: total morphological score; IC NO ec describes the lowest concentration, that 
has no effect on the TMS; ICsoMal is defined as the concentration at which 50% 
of all tested embryos of the group are malformed. IC50 and IC N oec concentra- 
tions are calculated from concentration response curves by eyeball estimation. 



• the highest rate of malformations observed 

(%); 

• IC50 for malformations = the concentration at 
which 50% of the embryos are malformed; 

• ICnoec fc> r ™ s = the maximum concentra- 
tion that has no effect on the total morpho- 
logical score (TMS); 

• IC ma x for malformations = the lowest con- 
centration that shows a maximum rate of mal- 
formations; and ^ 

• IC50 for TMS and IC 75 for TMS - the con- 
centration at which the mean TMS is reduced 
to 50 or 75% of the control group of embryos; 

Each of these endpoints may contribute to dis- 
tinguishing among the three classes of embry- 
otoxic chemicals. As described above for the 
EST, the stepwise selection of variables was per- 
formed using analysis of discriminance. Two of 
the variables were accepted in the analysis: IC50 
for malformations and IC N oec f° r TMS (Fig. 4). 
Table 6 shows the PM1 calculated from data ob- 
tained in the preliminary phase of the valida- 
tion study with six test chemicals and the pos- 
itive and negative control. The classification 
criteria, which are used to discriminate between 
now, weakly, and strongly embryotoxic chemi- 
cals, have been described above for the EST. 



The classification of the training set of chemi- 
cals according to PM1 provided 72% correct clas- 
sifications (Table 8). Further evaluation of the 
PM1 will be conducted after testing the re- 
maining 14 test chemicals. 

Prediction Model 2 (PM2), Since PM1 takes into 
account only parameters of differentiation and 
development, but not measure of cytotoxicity, we 
tried to improve PM1 by including cytotoxicity 
data provided by the cytotoxicity test with 3T3 
cells obtained in the EST (IC503T3, Fig. 2). This 
approach has previously never been used as an 
adjunct to the WEC test. Therefore, the IC503T3 
of the EST was incorporated in the discriminant 
analysis. However, this variable was not used di- 
rectly in the analysis, but, in addition to the seven 
variables used to develop PM1, the relative dis- 
tance between the IC503T3 and the no effect con- 
centration of the TMS (IC NO Ec) of the WEC test 
was used as an additional endpoint. Thus, the 
endpoints applied in PM2 are: (1) the lowest con- 
centration at which a maximum malformation rate 
is obtained (ICmax for malformations) and (2) the 
relative distance between IC503T3 and IC NO ec for 
TMS (Figs. 2 and 4). 

Table 7 shows the linear discriminant functions 
for PM2. The classification criteria, which are used 



Table 7. Linear Discriminant Functions I, II, and III of PM2 of the WEC Test 

Function I 0.21 * IC ^ 3T3 ~ IC NOEC TMS # 1QQ + ^ m Ig(iCmax) a _ 23 58 

K-5(pI;> 

Function II 0.27 • IG TC 3T3 ~ "j™* ™ S . 100 + 17.71 • Ig(IC max ) - 32.37 

Function III 0.093 • IC ^T3 ~ IC. OEC TMS m 1QQ + ^ . Ig(IQnax) _ 4 23 

IC50:? I D 



a ICmax refers to the lowest concentration, at which the maximum rate of malformations is observed. 
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Table 8. Classification Results of the WEC Test (32 Experiments with the First Six Test 
Chemicals of the Preliminary Phase of the Validation Study Including Positive and Negative 
Control Experiment as Training Set of Chemicals) (a) Using PM1 72% of Test Chemicals Were 
Correctly Classified and (b) Using PM2 84% of Test Chemicals Were Correctly Classified 

Predicted class 





Em bryotoxicity 


No. of test 


No. of 




in vitro 






in vivo 


chemicals 


experiments 


1 


2 


3 


(a) PM1 


non embryotoxic 3 


3 


12 






0" 


Training set of 


weakly embryotoxic b 


2 


8 




1 


0 


chemicals 


strongly embryotoxic c 


3 


12 


0 


2 




(b) PM2 


non embryotoxic 


3 


12 




3 


"o 


Training set of 


weakly embryotoxic 


2 


8 


2 


I 


0 


chemicals 


strongly embryotoxic 


3 


12 


0 


0 





a Class 1. 
b Class 2. 
c Class 3. 

to discriminate between won, weakly, and 
strongly embryotoxic chemicals, have been de- 
scribed above. The classification of the training 
set of chemicals according to PM2 provided 84% 
correct classifications (Table 8). Interestingly, 
strongly embryotoxic chemicals were correctly 
classified at a rate of 100%. It has to be stressed, 
however, that these are classification rates ob- 
tained with the training set, and they still have 
to be confirmed and evaluated by additional test- 
ing in the definitive phase of the blind trail. 

DISCUSSION 

In the present study we report the first results 
of an ECVAM validation study of three in vitro 
embryotoxicity tests, the EST, the MM test, and 
the WEC test. As indicated above, to date no 
previous validation study on in vitro embry- 
otoxicity tests has been conducted under stan- 
dardized conditions implemented as strictly as 
in the present study. The procedure implies the 
transfer of SOPs developed during prevalidation 
to secondary laboratories. Prevalidation was fin- 
ished in 1998 (Scholz et al., 1998). 

The validation study is being performed es- 
sentially as proposed by the ECVAM Workshop 
on Practical Aspects of the Validation of Toxic- 
ity Test Procedures (Balls et al., 1995), and will 
be finished early in the year 2000. According to 
recommendations of the ECVAM workshop, the 
main purpose of a validation study is to con- 
duct a blind trial, as a basis for determining 



whether one or more tests can be shown to be 
relevant and reliable for the given purpose. For- 
mal validation studies should comprise a pre- 
liminary phase, in which a small number of 
coded test chemicals are tested as a "training 
set," followed by a definitive phase, and a final 
evaluation of the outcome of the study. In the 
preliminary phase, which is the topic of the pres- 
ent report, a training set of six test chemicals 
was tested under blind conditions in the three 
in vitro embryotoxicity tests. 

Despite the longtime use of established in 
vitro embryotoxicity tests (e.g., the MM test and 
the WEC test) for none of these in vitro tests 
has a sufficiently validated PM been developed, 
to allow screening of new or structurally related 
chemicals for embryotoxic potential in either the 
drug or the chemical industry. Taking into ac- 
count the results of the preliminary phase, bio- 
statistical PMs were developed for the MM test 
and the WEC test. This is particularly important 
because development of PMs is an essential pre- 
requisite for the formal validation procedure 
(Curren et al., 1995; OECD, 1996). 

It has to be stressed that the classification of 
test chemicals according to the PM in the WEC 
test and the MM test was based on results of 
the training set. These classification results pro- 
vide a highly optimistic estimation, and testing 
with a set of new chemicals will prove the real 
power of the PM. This assumption is supported 
by our investigation of the EST. The PM of the 
EST had been sufficiently defined prior to the 
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preliminary phase of the validation study (Scholz 
et al., 1999), and thus underwent its first eval- 
uation in the preliminary phase of the present 
study. Results obtained with the EST on the 
training set of chemicals in the prevalidation 
study provided 93% correct classifications. How- 
ever, the rate of correct classifications decreased 
to 79%, when tested with the preliminary set of 
6 test chemicals of the current validation study. 

Although a wide spectrum of developmental 
parameters is evaluated in the WEC test (i.e., 16 
parameters for general morphological develop- 
ment and 30 types of malformations) morpho- 
logical development (total morphological score) 
and the percentage of malformed embryos pro- 
vide sufficient information for predicting the em- 
bryotoxic potential of test chemicals in a PM. 
Nevertheless, the impact of the number and 
specificities of malformations of individual em- 
bryos determined in the WEC test will be eval- 
uated at the end of the study. 

During prevalidation of the MM test it became 
obvious that midbrain cells do not provide data 
that are reproducible in an acceptable manner. 
Therefore, only limb bud cells were used in the 
MM test in the formal validation study. Further- 
more, the concentration response curves of the 
differentiation of limb bud cells monitored by 
AB staining and cytotoxicity assessed by NR up- 
take were almost identical. Subsequendy, in the 
PM or the MM test only the differentiation of 
limb bud cells was considered because it 
showed a higher predictive power than the cy- 
totoxicity to discriminate test chemicals accord- 
ing the three classes of embryotoxicity. 

Some of the test chemicals could not easily be 
assigned to the groups of weakly and nonem- 
bryotoxic chemicals in all three of the in vitro em- 
bryotoxicity tests. This is most probably due to 
the limited number of test chemicals and by spe- 
cific characteristics of these compounds. A com- 
prehensive discussion of limitations of the three 
in vitro embryotoxicity tests including an evalua- 
tion of the effectiveness and accuracy of the PMs 
will be given by the end of the validation study. 
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